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Abstract 

The two-user multiple-input multiple-output (MIMO) fast-fading interference channel (IC) with an arbitrary 
number of antennas at each of the four terminals is studied under the settings of Shannon feedback, limited Shannon 
feedback, and output feedback, wherein all or certain channel matrices and outputs, or just the channel outputs, 
respectively, are available to the transmitters with a finite delay. While for most numbers of antermas at the four 
terminals, it is shown that the DoF regions with Shannon feedback and for the limited Shannon feedback settings 
considered here are identical, and equal to the DoF region with just delayed channel state information (CSIT), it 
is shown that this is not always the case. For a specific class of MIMO ICs characterized by a certain relationship 
between the numbers of antennas at the four nodes, the DoF regions with Shannon and the limited Shannon 
feedback settings, while again being identical, are strictly bigger than the DoF region with just delayed CSIT. To 
realize these DoF gains with Shannon or limited Shannon feedback, a new retrospective interference aUgnment 
scheme is developed wherein transmitter cooperation made possible by output feedback in addition to delayed 
CSIT is employed to effect a more efficient form of interference alignment than is feasible with previously known 
schemes that use just delayed CSIT. The DoF region for just output feedback, in which each transmitter has delayed 
knowledge of only the receivers' outputs, is also obtained for all but a class of MIMO ICs that satisfy one of two 
inequalities involving the numbers of antennas. 

Index Terms 

Degrees of freedom, delayed CSIT, feedback, interference channel, interference aUgnment, MIMO, Shannon 
feedback. 
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I. Introduction 

THE characterization of the capacity of channels with feedback, where the channel outputs are known to the 
transmitter(s) with a finite delay, is a classical problem in information theory. For example, it is well known 
that feedback can not increase the capacity of a memoryless point-to-point channel |1|. Moreover, feedback can 
not increase the capacity of a point-to-point channel with additive, correlated Gaussian noise by more than one bit 
||T|. Interestingly, multi-user channels exhibit a different behavior. In particular, feedback can enhance the capacity 
of even the memoryless multiple access channel (MAC) |[2|, |[3| but again this improvement is bounded for the 
Gaussian MAC [3|. There has also been much interest in characterizing the capacity region of other memoryless 
networks with feedback such as the broadcast channel (BC) Q. However, due to the apparent intractability of such 
problems for more complex topologies, capacity approximations have been sought. Of these approximate capacity 
metrics, the degrees of freedom (DoF) region has received considerable attention. The DoF region denotes the 
rate of growth of the capacity region with respect to the logarithm of the signal-to-noise ratio (SNR) in the limit 
of asymptotically high SNR. For example, it can be deduced from [5] that for the 2-user Gaussian MIMO IC 
output feedback can not enhance the DoF region when there is perfect and instantaneous CSIT. In |6], the feedback 
capacity region is characterized to within a constant gap of 2 bits for the single-antenna (or SISO) IC. Further, it is 
well known that feedback fails to improve the DoF regions of the Gaussian MIMO MAC and the Gaussian MIMO 
BC. It is not yet known if there are networks with instantaneous CSIT for which (output) feedback enhances the 
DoF region. 

While the DoF of networks under the idealized assumptions of perfect, often global, and instantaneous CSIT have 
been well studied for numerous networks, the much more conservative setting of isotropic fading with transmitters 
having no CSIT has recently been extensively studied as well |[7|-p4|. Networks without CSIT but with (output) 
feedback have also been considered from which it is known that in the absence of CSIT feedback can enhance the 
DoF regions of the K-user BC ||15|-y/71, the 2-user SISO X channel [18], and 3-user SISO IC ||10j|, |^18J. Thus, 
unlike the instantaneous CSIT case, feedback can be beneficial even from the DoF perspective when there is no 
CSIT. This suggests that the benefit of feedback depends critically on the availability of CSIT since it is vastly 
different at the two extremes of having instantaneous CSIT and having no CSIT whatsoever. 

Moving beyond models that are either too idealized on the one extreme, or too conservative on the other, we 
consider here the delayed CSIT model wherein the channel state varies independently across time and the transmitters 
know perfectly the past channel states (cf. | |T5[ , 1 16|, 1 18 1). For such a setting we investigate the question of whether 
output feedback can improve the DoF region. Clearly, this question can be definitively answered only for networks 
for which the DoF with (just) delayed CSIT are known, of which there are but few. Of all networks for which 
the DoF are known except for the MIMO IC, this question has so far been answered in the negative. In particular, 
it is known that with delayed CSIT output feedback can not increase (a) the sum-DoF of the K-user MISO BC 
with at least K transmit antennas 1 15 1, (b) the DoF region of the 2-user MIMO BC |16|, (c) the sum-DoF of the 
3-user MIMO BC with N antennas at all three receivers and at most 2A^ antennas at the transmitter y/TJ and (d) 
the 2 X 2 X 2 interference network [19]. 

The only other exact characterization for the DoF region with (just) delayed CSIT is provided by the authors in 
[|20l for the two-user MIMO IC with an arbitrary number of antennas at each of the four terminals. Consequently, 
by obtaining the complete DoF region of this general two-user MIMO IC with delayed CSIT and output feedback 
(i.e., under Shannon feedback), and showing that for some cases there is a strict enhancement of the DoF region 
over that with just delayed CSIT, we answer the question of whether output feedback can enhance the DoF region 
of a network with delayed CSIT for the first time in the affirmative. 

In particular, it is shown here that if Mj and Ni, respectively, denote the number of antennas at the i^^ transmitter 
and the i^^ receiver of a two-user MIMO IC, then the DoF region with Shannon feedback is strictly bigger than 
the corresponding delayed-CSIT DoF region, if and only if one of the two inequalities, namely, 

Mi> N1+N2- M2 > Ni> N2> M2 > N2^ 

iVi — M2 

or its symmetric counterpart (obtained by switching the user indices), holds. For MIMO ICs for which neither of 
these two inequalities holds output feedback does not improve the delayed CSIT DoF region. 

To derive our main result, we first obtain an outer-bound to the DoF region with Shannon feedback. This outer 
bound is then shown to be achievable for all but the above described class of MIMO ICs using just delayed 



3 




Fig. 1. The (Mi, M2, A^i, 7V2) MIMO IC with Shannon Feedback 



CSIT. For the class where the DoF region is strictly larger than that with delayed CSIT, we develop a new 
retrospective interference alignment scheme in which each transmitter - using the side information available to it - 
reconstructs and transmits the previously transmitted signal of the other transmitter to provide an opportunity to its 
paired receiver to cancel the interference it encountered at a previous time instant, while simultaneously delivering 
new useful linear combinations to the unpaired receiver. Consequently, Shannon feedback induces a new form of 
transmitter cooperation which is key to realizing the DoF gains attainable with Shannon feedback over that with 
just delayed CSIT. 

Moreover, it is seen that to achieve this more efficient interference alignment all of the channel matrices and 
outputs are not needed at both transmitters. In particular, two limited Shannon feedback settings are described that 
are sufficient to achieve the DoF region with Shannon feedback. It is also observed that if in addition to delayed 
CSIT the feedback is allowed to be some designable function of past channel outputs and states, a setting that is 
more optimistic than Shannon feedback, the DoF region doesn't expand over that of the DoF region in the Shannon 
feedback case. Furthermore, with just output feedback without any form of CSIT, it is shown that the DoF region 
is the same as that for delayed CSIT with the exception of a class of MIMO ICs characterized by one of two 
inequalities involving the numbers of antennas at the four terminals. For this class, the DoF region remains an open 
problem at this time. 



The rest of the paper is organized as follows. Section II-A describes the model of MIMO IC under various 
assumptions about feedback, states the main results of this work on the DoF regions under these assumptions and 
provides an example of the new retrospective interference alignment scheme. Proofs of the results are contained in 
Sections llll] and llV] and the appendix. 



II. Channel Models, Main Results, and IA with Shannon Feedback 
In Section II-A[ the MIMO IC model with Shannon feedback, limited Shannon feedback, output feedback and 



designable Shannon feedback are described. Section II-B contains the main results on the DoF regions under these 



settings. In Section II-C[ we discuss how interference alignment can be achieved with Shannon and limited Shannon 



feedback. Section II-D provides some insight on the main results. 



A. The MIMO IC with Shannon Feedback 

The MIMO IC consists of two transmitters, denoted as Tl and T2, and their corresponding receiver, labeled Rl 
and R2, respectively, as in Fig. [T] The (Mi, M2, A^i, A''2) MIMO IC is defined via the input-output relationships 

Y^{t) = Hii{t)Xi{t) + Hi2{t)X2{t) + Wi{t), (1) 

Y2{t) = H2l{t)Xi{t) + H22{t)X2{t) + W2{t), (2) 

where, at time t, Yi{t) € C^'^^ is the signal received by the i*'^ receiver; Xi{t) € C^^*^^ is the signal transmitted 
by the i*'' transmitter; Hij{t) G ([^n.xM, jj^g channel matrix between the i^^ receiver and the j^^ transmitter; 
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Fig. 2. Comparison of the DoF regions of a MIMO IC 



Wi{t) £ C^'^^ is the additive white Gaussian noise at the i*'* receiver; and there is a power constraint of P on 
the transmit signals, i.e., E||Xj(t)|p < Py i,t. 

For simplicity, we study here the case of additive white Gaussian noise, i.e., all entries of Wi{t), i = 1,2, are 
independent and identically distributed (i.i.d.) according the complex normal distribution with zero-mean and unit- 
variance (denoted, henceforth, as CJ\f{0, 1)). Further, we assume that the channel matrices are i.i.d. Rayleigh faded, 
i.e., all elements of all channel matrices are i.i.d. according to CJ\f{0, 1) distribution (denoted as i.i.d. ~ CJ\f{0, 1)). 
Next, it is assumed that the realizations of additive noises and channel matrices are i.i.d. across time and that they 
are mutually independent of each other. 

Throughout this paper, both receivers are taken to know all channel matrices perfectly. Since there is no delay 
constraint on decoding it is assumed, without loss of generality, that CSI is available instantaneously to the receivers. 
Further, all terminals are always assumed to know the distribution of the channel matrices. 

We start by defining the term Shannon feedback (cf. [18]) and later consider other types of feedback. Here, 
the two transmitters are assumed to know the channel matrices and the channel outputs perfectly with a finite 
delay. This delay is taken to be of 1 symbol time without loss of generality. In particular, the channel matrices 
{Hij(t)}j and the channel outputs Yi{t) and Y2{t) are taken to be known perfectly to both transmitters at time 
t + 1. 



Notation: For n > 0, H{n) = -H'i2(t), i?2i(t), i?22 if n > 1 and H{n) 



if n = 0. Similarly, for each i E {1, 2}, Yi 



n) 



if n > 1 and Yi 



n 



if n 



0. 



some constant) 



Let A4i and 7W2 be two independent messages to be sent by Tl and T2, respectively, over a block length of n, 
where the message Aii is intended for the z*'' receiver. It is assumed that A4i is distributed uniformly over a set 
of cardinality 2"^*^^), Ri{P) > 0, when there is a power constraint of P at the transmitters. A coding scheme for 

pW_/^(n)^n 1,2, such that 



l),Y2{t-l)) withEl |2 <PVi,t, 



blocklength n consists of two encoding functions f- 

X^{t) = f^f{M^,H{t-l),Yi{t 
and two decoding functions such that 

Mi = g'f^HYi{n),H{n) ) where i e {1,2}. 



A rate tuple (i?i(P), i?2(P)) is said to be achievable if there exists a sequence (over n) of coding schemes such 
that probability oi Mi ^ Kii or M2 tends to zero as n — )• 00. 

The capacity region C{P) is defined to be the set of all achievable rate tuples i?2(P)) when the power 

constraint at the transmitters is P. The DoF region with Shannon feedback is defined as 



V {wi,W2) G 



widi +W2d2 < limsup - — - 



sup \^WiRi{P) + W2R2{P)] 

{R^iP),R2(P))<=C{P) 
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where denotes the set of pairs of non-negatives real numbers, and limsupp^o^ stands for the limit superior 
[ [21) as P — 00. It can be easily proved that the DoF region is closed | [2l| and convex |[l|. 
Consider next the MIMO IC under four other settings defined below. 

• Designable Shannon feedback: both transmitters know the channel matrices {Hij{t)}ij and modified channel 
outputs Yi{t) and Y2{t) at time t + 1 (in general, with some finite delay), where, for each i G {1,2}, 
Yi{t) G C^'^^ is a deterministic function of Yi{t) and 

• Limited Shannon feedback of Type T. each transmitter knows the other receiver's incoming channels and 
outputs with some delay (which, without loss of generality we take to be 1 time unit), i.e., the i^^ transmitter 
knows the channel matrices Hji{t) and Hjj{t) and the received signal Yj{t) all at time t + 1, for each 
(i,i)G{(l,2),(2,l)}. 

• Limited Shannon feedback of Type 2: each transmitter is provided at each time its own receiver's outputs as 
well as the four channel matrices, all with some delay (which, without loss of generality we take to be 1 time 
unit); i.e., the i*'^ transmitter knows Yi{t) and {Hjk{t)}'j i^^^ at time t + 1. 

• Output feedback: both transmitters know the channel outputs Yi{t) and Y2{t) at time t + 1 (or, in general, 
with a delay of finite number of time slots) but they have no knowledge of channel matrices whatsoever. 

The DoF regions of the MIMO IC under these settings can be defined in analogous manner to that with Shannon 
feedback, and are denoted, respectively, as D'^^, D'^^, D'^^, and D°p. Since the designable Shannon feedback 
setting is stronger than that of Shannon feedback and the limited Shannon feedback and output feedback settings 
are weaker, we have that 

Furthermore, the conditions of delayed CSIT, instantaneous CSIT, and instantaneous CSIT with output feedback 
are defined as follows: 

• delayed CSIT: the channel matrices {Hijit)}^ j=i are known to the transmitters at time t + I; 

• instantaneous CSIT: the channel matrices {-f^jj(i)}?j=i are known to the transmitters instantaneously (i.e., at 
time t); and 

• instantaneous CSIT and output feedback: the channel matrices {Hij{t)}j are known to the transmitters at 
time t, and additionally, they know the channel outputs Yi{t) and Y2{t) at time t + 1. 

The DoF regions corresponding to these three assumptions can again be defined analogously, and are denoted, 
respectively, as D^^^^^ D'^^^ and D'^s^'^°p. Clearly, 

j-jdCSI (- j-jiCSI -qS (- -piCSI&op 



B. Main Results 

The characterization of D'CSifeop ^^g^Q^^ asserts that, in the presence of instantaneous CSIT, output feedback can 
not improve the DoF region. 

Lemma 1: For the MIMO IC with i.i.d. Rayleigh fading, the DoF region with instantaneous CSIT and output 
feedback is given by 

j3iCSi&op ^ |(di,d2) < di < min(Mi,iVi), < d2 < min(M2,7V2) 

c?i + (i2 < niin [Ml + M2, iVi + iV2, max(Mi, A^2), max(M2, iVi)] }■ 

Moreover, D'^^^ = D''^^^'^°p. 

Proof: See Appendix |A] ■ 
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Definition 1: The region D^^^^j. is defined as 

oLer = {idi,d2)\ Loi = 0<di< mm{Mi,Ni), i = 1,2; 

^ A di d2 min(iV2,Mi + M2) 

i^l = '. — 7TT — TIT H '. — TIT S: 



min(Ari +iV2,Mi) min(iV2, Mi) ~ min(7V2,Mi) 
^ A di d2 ^ min(iVi,Mi + M2) 

^ ~ min(Afi, M2) ^ m.m.{Ni + N2, M2) ~ min(7Vi,M2) ' 

L3 = di +d2 < mill [Ml + M2,iVi + iV2,max(Mi,iV2),max(M2,iVi)]} . 

Note that the first two bounds on di and ^2 have been denoted as Lqi and L02 respectively; while the last three 
bounds on the weighted sums of di and d2 are denoted respectively by Li, L2, and L3. 

Theorem 1 (Outer-Bound with Shannon feedback): For the MIMO IC with i.i.d. Rayleigh fading, the DoF 
region with Shannon feedback is outer-bounded by the region D^^^^gj,, i.e., 

■L* !^ -L^outer- 



Proof: See Section III 



Definition 2: For a given i e {1,2}, Condition i is said to hold whenever the inequality 



M,>Ni + N2- Mj >Ni> N, > Mj > N, ^^ 

Ni — Mj 



holds for j G {1, 2} with j / i. 

Clearly, the two conditions are symmetric counterparts of each other. Moreover, the two conditions can not be true 
simultaneously. Condition i can not hold if Nj > Ni. 



Theorem 2 (The DoF Region with Shannon feedbacli): For the MIMO IC with i.i.d. Rayleigh fading, the 
DoF region with Shannon feedback is equal to the region D^^^^jj., i.e., 

— ■'-'outer- 

Proof: It is sufficient to prove that the region DQ^^^JJ. is achievable when there is Shannon feedback. We assume 
without loss of generality that A'^i > A'^2 (note, under this assumption, that Condition 2 can not hold). 



Suppose Condition 1 does not hold. Then from |20 Theorem 2], we observe that 



_ ■»dCSI 

'outer 

^dCSI 



outer — ^ 



SO that the theorem follows by noting that D'^'-'^^ ^ C D^uter- 

Thus, it is only required to show that the region D^^^^jj, is achievable when Condition 1 holds. The detailed proof 



is given in Section IV 



The basic idea behind the interference alignment (lA) based achievability scheme developed in Section |IV] to 
prove the above theorem is illustrated via an example in Section II-C which shows that 7^ j^dcsi ^j^^ provides 



insight as to why the DoF regions are not always identical. Further, Section II-D compares the techniques used to 
achieve lA under Shannon feedback and under delayed CSIT. 

Remark 1 (Comparison of and D'^^^^): Using Theorem [2] above and |20 Theorem 2], we observe that 
Y)S _^ £)dcsi Qj^jy Conditions 1 or 2 hold. In other words, in the presence of delayed CSIT, output feedback 

helps in improving the DoF region only when Conditions 1 or 2 hold. 

Remark 2: Using Lemma [T] we observe that output feedback can not enhance the DoF region when there is 
instantaneous CSIT. In contrast, output feedback improves the DoF region when there is just delayed CSIT. 

The next two corollaries extend the above results to MIMO ICs with limited Shannon feedback of Type I and 
Type II, just output feedback and with designable Shannon feedback. 
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Corollary 1: For the MIMO IC with i.i.d. Rayleigh fading, we have 



D 



isi 



D 



IS2 



D 



dS 



Proof: See Appendix B-A 



Corollary 2: For the MIMO IC with i.i.d. Rayleigh fading, we have 

D°P = 

if neither of the following two inequalities hold: min(Mi, Ni) > N2 > M2 and min(M2, A^2) > -^1 > -^i- 



Proof: See Appendix B-B 



Thus, the above corollary yields the DoF region with output feedback for a large class of MIMO ICs. When one 
of the above two conditions holds the DoF region with output feedback is not known. 

Following the submission of a conference version of this work, and simultaneously with its publication in |22|, 
Tandon et. al. reported the DoF region for limited Shannon feedback of Type II in \23\. 



C. Retrospective Interference Alignment with Shannon Feedback 

With Theorem [2] in hand, we know that D j^dcsi general. However, the proof of this theorem is involved, 
and therefore, we provide an example in which / j^dcsi j^y demonstrating that a point outside D'^*-'^^ can be 
achieved with Shannon feedback. The proof that this scheme works is based on a series of simple propositions. 

In particular, we consider the (6, 2, 4, 3) MIMO IC shown in Fig. [5] For this IC, the DoF regions with just 
delayed CSIT and with Shannon feedback are plotted in Fig. |2j from which we observe that the former is strictly 
smaller than the latter. Moreover, it is clear from Fig. [2] that when (i2 = 2, di < | with delayed CSIT. Here, we 
prove the achievability the DoF pair (^,2) with Shannon feedback, which establishes that / £)dcsi gjjj^e 

12 \ 5 
7^3- 

Toward this end, we show that by coding over 7 times slots, 12 and 14 DoF can be achieved for the first and 
the second transmit-receive pairs, respectively. In our scheme, T2 transmits 2 data symbols (DSs) intended for R2 
over each time slot and thus a total of 14 DSs are sent; whereas Tl transmits 6 DSs intended for Rl at t = 1 and 
t = 4. Further, at t = 7, we show that desired DSs can be successfully decoded by both receivers. 

Consider first a transformation which simplifies the description of our scheme. At time t, the i^^ receiver can 
compute a unitary matrix Ui2{t) such that it is deterministic function of Hi2{t) and the bottom {Ni — 2) rows of the 
transformed matrix Ui2{t)Hi2{t) consist only of zeros. Using it, the i^^ receiver evaluates the transformed output 
Ui2{t)Yi{t). Henceforth, the transformed quantities Ui2{t)Hi2{t) and Ui2{t)Yi{t) are denoted simply as Hi2{t) and 




It is assumed, without loss of generality, that only the first two 
rows of H^2 3nd H22 contain non-zero entries. 



Fig. 3. The (6, 2, 4, 3) MIMO IC Considered in Section 



II-C 
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Fig. 4. lA scheme for Achieving ( " , 2) with Shannon Feedback over the (6, 2, 4, 3) MIMO IC: t = 1 to t 



yj(t), respectively. Evidently, the transmit signal X2{t) affects only the first two entries of (the transformed) Yi{t). 
Hence, throughout this subsection, we assume without loss of generality that the bottom {Ni — 2) rows of Hi2{t) 
consist only of zeros for all t, and thus, the signal X2{t) can affect only the first two antennas of Rl and R2 (see 
also Fig. |3]l. 

Consider the operation of the scheme at t = 1; see also Fig. |4] At this time, Tl transmits i.i.d. complex Gaussian 
DSs {lijlf^i intended for Rl, while T2 sends i.i.d. complex Gaussian DSs vi and V2 for R2. Thus, the transmit 
signals are formed (for a vector Vi, Vij denotes its j*^ entry) as follows: Xij(l) = V « G [1 : 6] and X2j{l) = vj, 
j = 1,2. The received signals at Rl and R2 can be written as follows (note that since the additive noises do not 
alter a DoF result, they are ignored) with desired and interfering linear combinations of data symbols defined using 
the symbols LC and /, respectively, so that 

Yu{l) = Hui{l)Xi{l)+Hii2{l)X2{l), iG[l:4] 
= /7m(l)K "2 ••• + Hu2il) [vl v*^]* 

"V ^ V 

Y2^{l) = H2j2(.l)X2{l) + F2,l(l)Xi(l) j G [1 : 2] 

= H2,2il)[vl V*2]* + H2jlil) [ul Ul ■ 



— LC2j — l2j 

and moreover, since we have assumed without loss of generality that the bottom {N-i — 2) rows of Hi2{t) consist 
only of zeros V t and i, we have I13 = I14 = LC23 = (see also Fig. |4]). Thus, 



Yu{l) = LCu i G [3 

Y2j{l) = LC2,+l2i J G [1 

123(1) = l23. 



2], 
4], 

2], and 



At time t = 1, both receivers encounter interference, and therefore, can not decode their desired data symbols. 
Moreover, the interference at a given antenna of a receiver is the hnear combination of the DSs sent by its unpaired 



9 



transmitter. Thus, each transmitter can compute the past interference encountered by its unpaired receiver using 
just delayed CSI, as stated in the following proposition. 

Proposition 1: Tl, at time t = 2, can compute l2j V j G [1 : 3] using just delayed CSI. 

Proof: Tl knows DSs Uj's; and time t = 2, and it also knows H2ji{l) because of delayed CSIT knowledge 

and hence = -ff2ji(l) ^ "■■ ""e]*- ■ 

Hence, at t = 2 and f = 3, Tl transmits l2j V j G [1 : 3] as shown in Fig. [4j while T2 continues to transmit 2 
new DSs intended for R2. In particular, the transmit signals are given as 



Xui2) 


= Xu{3) 


= V i G [1 : 3], 




Xui2) 


= h3, 


Xi5(2) = I22, 


Xw{2) 


Xui3) 


= -^23 1 


^15(3) = Xi6(3) 


= 


^21 (2) 


= ^3, 


^22(2) = Vi 




^21 (3) 


= ^^5, 


^22(3) = Vq, 





where V3, ■ ■ ■ , vq are i.i.d. complex Gaussian DSs intended for R2. Consider now the decoding operation at the 
receivers, starting with R2. The following proposition states that R2 can decode the desired DSs vi, ■ ■ ■ , vq at time 
t = 3. 

Proposition 2: At time t = 2, R2 can decode v^, v^, and /22- At time t = 3, R2 can decode v^, vq, and /2i- 
After determining /22 and I21, R2 can evaluate vi and V2, and thus, at t = 3, decoding is successful at R2. 

Proof: At time t = 1, R2 knows 123. Hence, it can subtract the contribution due to /23 from 12(2). Thus, 
equivalently, for R2, only 3 transmit antennas sending a non-zero signal at this time. Therefore, R2, via simple 
channel inversion, can determine v-^, v^, and /22- That at time t = 3, R2 can decode ^5, vq, and /21 follows 
similarly. After knowing the values of /21 and /22, R2 can evaluate 5^2i(l) — hi = LC21, and similarly, LC22- In 
other words, at t = 3, it can obtain 2 linear combinations (LCs) of vi and V2. Thus, it can decode vi and V2- ■ 

Consider now the case of Rl. At t = 3, as per the next proposition, Rl knows I21, I22, and /23- Since these are 
linear combinations of ui, ■ ■ ■ , uq, they are useful for Rl. 

Proposition 3: Rl can determine the values of /22 and I23 at time t = 2, and that of I21 at time t = 3. 

Proof: Rl can simply ignore the first two receive antennas, which experience interference due to the signal of 
T2. Then, using the last two antennas, it can compute the required symbols using channel inversion. ■ 

Thus, at t = 3, Rl gets 5 linear combinations, namely, LC23, LC24, hi, h2, and hs, of 6 desired DSs. Thus, 
it needs one more useful linear combination for successful decoding. Consider the next proposition. 

Proposition 4: Given that Rl knows the values of LC23, LC24, hi, h2, and /23> it can decode the six symbols 
ui, U2, • • • , Uq, provided it knows /12. 

Proof: If Rl knows LC23, LC24, hi, h2, and 723> then it is sufficient for it to know one more linear combination 
of ui, U2, ■ ■ ■ , U6, which it can compute using /12 as follows: ^12(1) — h2 = LC12 is a linear combination of Rl's 
desired symbols. ■ 

Hence, it is sufficient to communicate the value of I12 to Rl. Consider the following proposition. 

Proposition 5: Tl can compute I12 at t = 2 using Shannon feedback, but not using just delayed CSI. Moreover, 
R2 knows /12 at time t = 3. 

Proof: Because of Shannon feedback, Tl knows ¥2(1), H2i{l), and i/22(l) at time t = 2 by virtue of Shannon 
feedback. Since it knows -'^i(l) by default, it can compute 

(i722(l))^{l2(l)-i^2l(l)Xi(l)} = (F22(1))V22(1)X2(1) =X2(1) 

at time t = 2, where fH22{^)) is the pseudo-inverse of i/22(l) 1^2^- Subsequently, it can evaluate I12, which is 



a linear combination of vi and V2. R2, after decoding vi and V2 at time t = 3, can compute /12 since it knows all 
channel matrices. ■ 
Thus, in light of this proposition, Tl can convey /12 to Rl at time t = 6 without interfering with decoding at R2. 

Consider the next block of 3 time slots, i.e., for t = 4, 5, 6. The scheme here is identical to that for the first three 
time slots; see Fig. [s] At time t = 4, Tl transmits DSs u[, u'2, ■ ■ ■ , u'q intended for Rl. T2, on the other hand, 
transmits DSs v'^, v'2, ■ ■ ■ , v'q for R2. Here, the superscript prime is used to indicate the quantities that are specific 
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t = 6 t = 5 t = 4 




I' I' I' 

'23 '22 '21 



• Rl can decode its desired symbols if it knows I'i2- 

• R2, after decoding v\ and v'2, can compute I\2- 

• Tl knows v\ and v'2 at t = 5, and hence also I'i2- 

Fig. 5. lA scheme for Achieving ( " , 2) with Shannon Feedback over the (6, 2, 4, 3) MIMO IC: t = 4 to t = 6 




O outperform delayed CSIT. 



Fig. 6. lA scheme for Achieving (^,2) with Shannon Feedback over the (6, 2, 4, 3) MIMO IC: t = 7 



to this block of three time slots. The only change in the transmission scheme is that at time t = 6, Tl transmits 
Ii2- Consider the following propositions which will be used to describe decoding at the receivers. 

Proposition 6: Consider receiver R2: (a) at time t = 5, R2 can decode ^3, v'^, and (b) at time t = 6, R2 
can decode t;^, v'^, and l!^^ and (c) after determining ^^id ^23' evaluate v[ and and thus, at t = 6, 

R2 can perform successful decoding. 

Proof: The proofs of Parts (a) and (c) similar to those of Parts (i) and (iii) of Proposition [2] Further, Part (b) 
follows from the proof of Proposition [2jii) by noting that I12 is known to Rl at time t = 3. ■ 

Proposition 7: Consider receiver Rl: (a) Rl knows I12 at time t = 6, from which it can decode ui, U2, • • • , 
uq, (b) Rl can compute I22 and at time t = 5, and I21 at time t = 6 (c) if Rl is conveyed the value of 1(2 at 
time t = 7, it can decode and (d) Tl knows /(g at time t = 5. 

Proof: The four parts follow respectively from Propositions |4| [3} |4j and |5] ■ 
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t =3 t =2 t = 1 








hi 






hi 

IT 

V3 



174 



U2 




tnused ! 




unused ! 



Fig. 7. lA scheme for Achieving (|, 2) with Delayed CSIT over the (6, 2, 4, 3) MIMO IC 



Thus, as per the two propositions, at i = 6, R2 can decode all symbols sent to it until that time, whereas Rl can 
do the same if it is delivered the value of /{g at time t = 7. Next, consider t = 7. As shown in Fig. [6| Tl transmits 
just I[2, while T2 sends two new data symbols and It is easy to show that Rl can decode /(g' whereas R2 
can decode v[ and Hence, as desired, we can achieve a DoF pair (12, 14) over 7 symbol times. 

It is instructive to compare the above Shannon feedback scheme with the delayed-CSIT coding scheme of pOj 
that can only achieve the pair (1,2) over the (6, 2, 4, 3) MIMO IC as illustrated in Fig. [v] in terms of the notation 
introduced earlier in this sub-section. In this latter case, by coding over 3 time slots, we achieve 5 and 6 DoF for 
the two transmit-receive pairs, respectively. Note that Tl can not determine I12 (and /(g) with just delayed CSIT, 
and thus only 5 linear combinations can be delivered to Rl over a span of 3 time slots. Hence, Tl transmits only 
5 DSs for Rl at time t = 1, which Rl can decode at t = 3. Except for this difference (compare Figs. |4]and|7]), 
this coding scheme is identical to the Shannon feedback coding scheme. 

Note that under this delayed-CSIT scheme, the sixth antenna of Rl is never used. Moreover, one of the last 
two antennas of Rl, say, the fourth, remains unused, although it never experiences interference. In other words, in 
the delayed-CSIT scheme, some of the resources are not utilized. Shannon feedback on the other hand permits the 
exploitation of these resources - the sixth antenna of Tl is used with Shannon feedback and both interference-free 
antennas of Rl are used under the Shannon-feedback scheme at t = 4, 5, 6 - thereby outperforming delayed CSIT 
feedback. 



D. Comparison of lA with Shannon Feedback and lA with Delayed CSIT 



In the Shannon-feedback coding scheme of the previous sub-section (and more generally of Section IV 1, one 
may observe that the following two types of techniques are used to achieve lA: 

1) Since the interference at a given receiver is a linear combinations of the DSs sent by its unpaired transmitter, 
each transmitter, using delayed CSIT, can evaluate and then transmit the interference seen in the past by its 
unpaired receiver to convey new useful linear combinations to its paired receiver without creating any new 
additional interference to the unpaired receiver. 

2) Equipped with the knowledge of past channel outputs and past channel matrices, each transmitter can compute 
and transmit the interference encountered in the past by its paired receiver to provide an opportunity to its 
paired receiver to cancel the past interference while conveying useful information to its unpaired receiver. 

Note that the DoF-region-optimal lA-based achievability schemes developed in 1 20 1 for the MIMO IC with just 
delayed CSIT make use of the first technique but not the second one, because the latter is feasible only in the 
presence of output feedback. Output feedback with delayed CSIT on the other hand enables each transmitter to 
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Tl Rl 




Fig. 8. Illustration of notation. 

compute the past transmit signal of the other transmitter which introduces partial transmitter cooperation which is 
infeasible when there is just delayed CSIT; remarkably, this transmitter cooperation reveals all available signaling 
dimensions and achieves the DoF gains promised by Shannon feedback. 

Note that while transmitter cooperation is induced by output feedback regardless of whether there is instantaneous 
CSIT or delayed CSIT, it is only in the case of delayed CSIT that such cooperation provides a DoF-region improve- 
ment. With instantaneous CSIT, the transmit signals can be suitably beamformed to cause minimal interference at 
the receivers. With delayed CSIT however, transmit beamforming can not be employed and hence the receivers 
experience a relatively high amount of interference so that output feedback is more effective. 

III. Proof OF Theorem [1] 

If {di,d2) G D^, then (^1,^2) G D'*^^^'^°p. Therefore, by Lemma [T] bounds Lqi, Lq2, and L3 must hold at any 
{di,d2) G D^. Now, note that Li and L2 are symmetric counterparts of each other (i.e., any one of them can be 
obtained from the other by changing the user ordering). Hence, it is sufficient to prove that Li holds, which is the 
goal of the remainder of this section. 

Before we prove that Li is an outer-bound, we introduce some further notation used henceforth in the paper. 

Notation: The set of four channel matrices at time t is denoted by H{t), i.e., H{t) = where i, j G {1, 2}. 

For integers ni and n2, if ni < n2, [rii : 712] = {ni, ni + !,••• , 722}; whereas if ni > n2, then [ni : 712] denotes the 
empty set. For a random variable X{t), X{[ni : n2]) = {X{t)}^l^ if ni < n2, whereas X{[ni : 722]) denotes an 
empty set if ni > 122- Further, for n> 1, X{n) = X{[1 : n]). For the received signal Yi{t) and the channel matrix 
Hik{t), the j^^ entry and the j^^ row are denoted respectively by Yij{t) and Hijk{t). Further, whenever ni < 722 
and ns < n^, yi[n,:n^](S) = {^ii(i)}"=„,, ^i[ni:n.](N : n^) = {{^ii(0}"=njr=n3' Hi[n,:n,]jit) is the channel 
matrix from j^^ transmitter to channel outputs i^i[ni:n2] (*) (^^^ Fig- however, if ni > n2 and/or na > n^, then 
Yi[n,:n,]it) and yi[„,:„,]([n3 : 714]) denote empty sets. Moreover, for n > 1, = Yi[^^,^^]{[l : n]). Finally, 

o(log2 -P) denotes any real-valued function x{P) of P such that limp_>oo i^^^ = 0- 

We will show that the bound Li must hold at any {di, ^2) G D^. We first apply Fano's inequality to upper-bound 
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the rates achievable for the two users starting below with R2. 



nR2 < l{M2;Y2{n) 



Hin) + ne„ 



hlYoin] 



H(n\ 



h[Y2{n) 



M2,H(n)] +ne 



ri 

h(Y2{n)\H{n)^ - ^h(Y2{t)\Y2{t - l),M2,Hin)) + 

t=i 



ner, 



< h(Y2{n) Hin)) -^h(Y2{t) Y2{t - 1), M2, X2{t), H{n)j + nen, 

t=i 

n 

= h(Y2{n) Hin)) - ^h(Y2{t) F2(t - 1), A^2, ^2 ^(t)) + ne„, 



(3) 
(4) 

(5) 
(6) 
(V) 



t=i 



where e„ ^ as n — cxd; the inequality Q holds since conditioning reduces entropy fV\; and the equality in Q 
follows on noting that random variables |l2(i);^2(i — 1), -^2, -^2(i)} are independent of Hi[t + 1 : n]). 

We next use Fano's inequality at Rl assuming that it knows the received signal i2(t) instantaneously and also 
the message A42 to obtain the following: 



(8) 



nRi < I[^Mi;Y2in),Yi{n),M2 Hin)j +nen 
= l(^Mi;Yiin),Y2{n)\M2,Hin)) +nen 

n 

= ^h(Yiit),Y2it)\Yiit - l),Y2it - l),M2,Hin)) 
t=i 

n 

-Y,h{Y,it),Y2it) 



t=i 



Yiit-l),Y2{t-l),M2,Mi,Hin)j+nen 
^ /i (t) , y2 (t) I Fi (t - 1 ) , F2 (t - 1) , 7W 2 , X2 (t ) , 

n 

-^h(Yiit),Y2it)\Yiit - l),Y2{t - l),M2,Mi,Xiit),X2it), Hin)) + ne„ 

n 

^h(Yiit),Y2it) Yiit - l),F2(t - l),M2,X2it),Hit)' 



(9) 



(10) 



t=i 



-Y,h(Yiit),Y2it) 



t=i 



Yiit-l),Y2it- l),M2,Mi,Xiit),X2it),Hit) ) + ne„ 

^h(Yiit),Y2it)Yiit-l),Y2it-l),M2,X2it), Hit) ] 
1 

n 

Y,h{Wiit),W2it))+nen 
t=i 

{h(Yiit),Y2it)\Yiit - l),F2(t - l),M2,X2it)Hit)) + o(log2 P) + en] , 



(11) 



(12) 



(13) 



t=i 



where various steps follow because of the following reasons: the equality in ([8]) holds due to the independence of 
the two messages; equality Q holds because of the definition of the mutual information and the chain rule for 
the differential entropy; equality ( [To|) f ollows by noting that the transmit signal Xiit) is a deterministic function 
of M-i, Yiit — 1), Y2it — 1), and Hit); (111 holds since all the involved random variables are independent of 
Hi[t + 1 : n]); (12 1 holds because translation does not change differential entropy, and Wiit) and 1^2 (*) are 
independent of Yiit — 1), Y2it — 1), Ai2, -Mi, Xiit), X2it), and Hit); the final equality holds since the noises 
are i.i.d. across time and their statistics are independent of P. 
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Lemma 2: Let mi = min(Mi, A^i + A''2) and m2 = min(Mi, A''2). Then, for each t G [1 : n], we have 



m2 



h{Y2{t) Y2it-l),M2,X2it),H{t) 



> —h{Yi{t),Y2{t) Y,{t-l),Y2{t-l),M2,X2{t),H{t)) +o(log2P) 
mi V / 



where the term o(log2 P) is constant with n. 
Proof: See Section [IIFaI 
Combining the bounds in Q, ( [T3] ), and the one in Lemma [2j we get 



— ii2 < -^—h(Y2in) 

7712 "1-2 ■ 'l- 



H(n)] + 



1712 



R2 Ri ^ min(iV2,Mi + M2) , 

\ < ^ ■ log2 P + e„ 

7712 mi 7TT-2 



1 

mi 
1 

m2 



Ri - o(log2 P) - en 

+ 1 1 +0(l0g2P), 



where the last inequaUty holds since the DoF of the point-to-point MIMO channel are equal to the minimum of 
the number of transmit and receive antennas. Since e„ — as — oo, we now have 

R2 ^Ri , min(A^2,Mi + M2) 

+ < log2 P + o(log2 P) 

m2 mi m2 

_^ d2 di i?2 , Ri . min(A^2,Mi +M2) 

=^ \ < lim sup 1 < 

m2 mi p-s>oo "1-2 mi m2 

as desired. 



A. Proof of Lemma |2] 

In the following two lemmas, it is shown that although the received signals Yi{t) and Y2{t) are A^i and N2 
dimensional, respectively, only the first mi — m2 and m2 entries of them are relevant as far as the current DoF 
analysis is concerned. 

Lemma 3: If m2 = min(Mi, A'^2)> we have the following: 



h[Y2it) Y2{t-l),M2,X2it),H{t)j > h(Y2[i.,rn4t) Y 2{t - 1) , M2, X 2it) , H (t) j +0(l0g2P), 

where the term o(log2 P) is constant with n. 

Proof: Follows from the techniques in \20. Proof of Lemma 2]. 
Lemma 4: If mi = min(Mi, A''i + A^2), then 



h[Yiit),Y2it)Yi{t-l),Y2it-l),M2,X2it),H{t)^ 

</i(n[l;„^,-„^.](^),%™.](^) Fi (t - 1) , F2 - 1) , 2 , ^2 (t) , ) + o(log2 P) • 



where the term o(log2 P) is constant with n. 

Proof: Follows from the techniques in |20. Proof of Lemma 3]. ■ 
If mi — m2 = 0, Lemma [2] holds trivially. Hence, in the following, we may consider without loss of generality 
that mi > m2. 

We now prove the following lemma which is critical in the proof of Lemma [2] 

Lemma 5: Let Q{t) = {M2,H{t),Y2{t - l),X2{t)}. For an i G [1 : m2 - 1] and a /c G [1 : mi — m2], if 
j = i + 1 and I = k + I, we have the following equalities: 

Q(i),^2[l:*-l]W); 
h (Yu (t) I Q(t) , y2[l:m.] (t) , Yili:k-l] [t] 

Proof: It is sufficient to prove the first equality. Define Y^{t) = Y2{t) - H22{t)X2{t) = H2i{t)Xi{t) + W2{t). 



h[Y2i{t) g(t),l2[l;^-l](i) 
h(Y2mAt)\Q{t),Y2li.,„,,^l]{t) 

h(Yik{t) Q{t),Y2ii,m,]{t),Yiii.,k-i]{t) 



h{Y2,it) 

h(Yii{t) 
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Toward this end, we have the following sequence of equalities, 



h(Y2i{t) 

E, 

E 



QW,i^2[l:i-l](t) 



(14) 



M2,H{t - l),Y2{t - l),X2{t),Y^[,,,_^{t), H2[i..,]i{t),Hu{t),m2{t),H22it), //2[m:7V.]l W ) (15) 

M2,H{t-l),Y2it-l),X2{t),Y^^,,,_^{t),H2ii..,]i{t) 



H...{t)=a h[Yii{t) M2,H(t-l),Y2{t-l),X2{t),Y^[,.,,_^{t),H2[l..,-l]l{t),H2^l{t) 



H2ji(t)=a 



h(Y^j{t) A^2,i?(t-l),F2(t-l),X2(t),y2Vl](0,^2[l;i-l]l(i),^2il(t) 
(ya',- (t) I A^2 , - 1) , F2 (t - 1) , X2 (t) , y2V.-l] W ' ^2[1;»-1] 1 (t) , ^2jl W 

h(Y2j{t) M2,H{t-l),Y2it-l),X2{t),Y2[,.,i^^{t),H{t)), 



(16) 
(17) 
(18) 
(19) 
(20) 

where the various equalities hold as follows: ^14\ holds by the definition of Q{t); ( jlsj ) holds because translation 
does not change differential entropy ( |16| ) follows by noting that //i2(t), -^22(^)5 ^^2[i+i:Af2]i(0} 

independent of {Y2^{t),M.2,H{t — 1),Y2(T^ 1) , X 2{t) ,Y^^^,._^{t) , H2[i:i]i{t)] (note the present channel matrices 
are independent of the present and the past channel inputs and noises); ( [TT] ) holds by the definition of the conditional 
differential entropy; ( 18 1 holds because conditioned on {A42, H {t — 1) ,Y 2{t — 1) , X 2{t) ,Y^^-^,._^{t) , -f^2[i:i-i]i(0}' 
the joint distribution of {H2ii{t) , Xi{t) ,W2iit)} is identical to that of {H2ji{t), Xi{t),W2j{t)}; (fToll holds by the 
definition of the conditional differential entropy; (20l holds since |i/2ii(t), ^^2[i+2:Af2]i(^)' ^12(*)) ^22(i)} 
are independent of {Y2j{t),M2, H{t — l),Y2{t — l), X2{t),Y2^-^,._-^^^{t), -f^2[i:j-i]i(*)i H2ji{t)] and since translation 
does not change differential entropy. ■ 

Note that the first equality in the above lemma asserts that the signals Y2i{t) and Y2j{t) received at the i^^ and 
j^^ antenna, respectively, of R2 have equal differential entropy, when conditioned on the channel matrices H{t), 
the message A42 and the transmit signal X2{t) of T2, the past channel outputs Y2{t — 1), and the present channel 
outputs l2[i:i-i](*) some other receive antennas. We refer to this property as the statistical equivalence of the 
channel outputs, which essentially says that given the past and present channel outputs, the signals received at any 
two antennas of the system provide an equal amount of information about Aii. Note that this property of statistical 
equivalence of the channel outputs was shown to hold in |20| for the case of delayed CSIT. Here, on the other 
hand, the same property is shown to be true under the stronger setting of Shannon feedback. 

The above lemma yields the following simple corollary, where Q{t) = {A42, H{t),Y2{t — 1), X2(t)} as before. 

Corollary 3: For an i G [1 : 7712 — 1] and a A; G [1 : mi — 7712], if j = « + 1 and / = A; + 1, we have the following: 

> 



h[Y2^{t) Q(i),y2[l:.-l]W 



h[Y2,{t) 



h{Y2rnAt)\Q{t).y2[l:m,^l]{t)) > h(Yi,{t) 



Q(t),y2[l;i](t)j; 
Q{t),Y2li,rr.,]{t)); 



h{Yik{t) Q(t),y2[l;m.](i),^l[l:fc-l](i)j > h[Yil{t) Q{t),Y2li,m,]{t),Y^^:k]{t)). 

Proof: Follows from the previous lemma by invoking the fact that conditioning reduces entropy |T|. 
Lemma 6: We have 



{t)\Q{t)] >m2-h[Yi 

[l:mi— 012] 

{t),Y2ii.,m,mQ{t),Yi{t-l) 
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Proof: By the previous corollary and the chain rule for the differential entropy, we get 



1 



7712 



-Y^h{Y2i{t) g(t),%,_i](t) 



7712 



•t=l 



[l:m2] {t)m) 

> h(Y2mAt)\Q{t),Y2li..m,-l]{t) 

>/i(yn(t)|Q(t),y2[i;™.]W) 

> ^(^l[l:mi-m2](0 , ^2[l:m2] • 

mi — 772-2 ^ ^ J / 

This yields 

(7771 - 7772) " h(Y2[i:m2]{t) Q{tj^ > 7772 " ^ ('5^1[l:mi -ma] (0 

> 7772 • /7(^Y'i[i:m^„m,](t) 

since conditioning reduces entropy. Similarly, we can obtain 

m2 ■ h(Y2ii..„,,]{t) Q{t)) > m2 ■ h(Y2ii.,^,]{t) Q{t),Yi{t - I)) . 



Q{t),Y2li,^,]{t) 

Q{t),Yi{t-l),Y2[i.,^,]{t) 



(21) 



(22) 



The lemma can now be obtained by adding the inequalities in ( [21] ) and ([22]). ■ 
The inequality in Lemma [2] can now be derived by combining the results of Lemmas [3] |4] and [6] and by noting 
that the sum or the difference of two o(log2 P) terms yields another o(log2 P) term. 



IV. Proof OF Theorem [2] 

As mentioned before, it is sufficient to prove that the outer-bound Dq^^^j. is achievable when Condition 1 holds. 
Throughout this section, it is assumed that Condition 1 holds. 

Here, bound L2 can be easily shown to be redundant (it is implied by L3), and thus can be ignored. Further, in 
the present case, bounds Li and L3 are given by 

^^ = JIT + ^<^ and L3 = di+d2<Ni, 

I\2 

where M[ = min(Mi, iVi + N2). 

The typical shape of the outer-bound is shown in Fig. [9] where Po2,i is the point of intersection of the line 
d2 = M2 and the one corresponding to bound Li, similarly Po2,3, and Pi^a is the point of intersection of lines 
corresponding to bounds Li and L3. Moreover, 



P 



02,1 



,N2-M2 

M[^j-^,M2 



Pi, 



No 



No 



, and Po2,3 = (iVi -M2,M2). 



Depending on whether the (i2 -coordinate of Pi 3 is less than M2 or not, we have to consider two cases separately. 
.CaseA:M(>iV2^5A| 



Here, bound Li is redundant. Moreover, from Fig. m one may observe that if Po2,3 £ then D^^^^ 
Hence, we find here sufficient to prove that Po2,3 £ D ■ 
. CaseB:M{<iV2fe5^ : 

Here, bounds Li and L3 are both active. From Fig.joj we observe the sufficiency of proving that Po2,i, Pi,3 £ D^. 
Next, we propose a generic retrospective interference alignment scheme, which is used later to prove that Po2,3 £ 
under Case A and Po2,i,Pi,3 G under Case B with an appropriate choice of parameters. This scheme is 
specified in terms of the parameters 

T, ti, t2, and {"72(i)|._^, (23) 

where T,ti,t2 S N, m\{i),m2{i) G NU {0} V i, and Design Criteria 1-5, which are stated in the sequel. It 
is developed such that if, for a given a DoF pair P = (^1,^2) and the given (Mi, M2, A^i, iV2) MIMO IC, the 
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d2-coordinate of P^^^ is greater than zero. 

Outer-bound 

if d2-coordinate of Pi 3 < M2, 

i.e., Case B 

if d2-coordinate of Pi 3 > M2, 

i.e., Case A 



Fig. 9. Two Possible Sliapes of the Outer-Bound wlien Condition 1 Holds 



parameters in (23 1 can be chosen as functions of (^1,^2) and (Mi, M2, A^i, so that Design Criteria 1-5 are 
satisfied, then the DoF pair P = (di, ^2) G of the given (Mi, M2, iVi, N2) MIMO IC. 

Consider now the retrospective interference alignment scheme. The goal is to prove that a given DoF pair 
P = {di, ^2) £ D^. Let us first state two important design criteria. 

Design Criterion 1: Choose positive integers ti and t2 such that ti + ^2 = T. 

Design Criterion 2: Choose a positive integer T such that Tdi and Td2 axe integers. 

Now, choose a B £ N and set 



T* = {B + 1) ■ T, di =B-T-di, where i G {1, 2}, and = 



d\ di\ I d^ d2 



It will be proved that for any positive integer B, by coding over T* time slots, we can simultaneously achieve d* 
and (ig DoF for the two users respectively. This implies that P G D^, since the DoF region is closed, and the point 
P* converges to P as i3 — 00. Thus, our aim in the following is to prove the achievability of point P*. 

The entire duration of T* is divided into B + 1 blocks, each consisting of T time slots. Each block is further 
divided into two phases with Phase One having ti time slots and Phase Two the remainder of t2 = ^ — ti time 
slots. 

Definition 3: Define two functions b{t), the index of the block to which time slot t belongs, and t{t), the index 
of that time slot within Block b{t), as 



m 



t 



and t{t) = t-T ■ {b{t) -l). 



Note that t{t) G [1 : T]. Thus, each time slot t can be uniquely identified by the pair (6(t),t(t)). Block b, 

6 G [1 : -B + 1], consists of time slots t G [b'T + 1 ib'T + T], where 6' = 6 - 1; we let Phase One of block b consist 
of time slots t G [b'T + 1 : b'T + ti] and Phase Two the remaining time slots t G [b'T + ti + I : b'T + ti + ^2]- 
The general structure of our achievability scheme has the following features: 

• In each of the first B blocks, TI and T2 respectively transmit Tdi and Td2 i.i.d. complex Gaussian data 
symbols (DSs) intended for Rl and R2, respectively. In Block P + 1, no new DS is sent. 

• In each time slot, T2 transmits an appropriate number of new DSs intended for R2, and therefore, in some 



sense, it does not play an active role in aligning interference (as in the example of Section II-C I 
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Tl, on the other hand, transmits all Tdi DSs, to be transmitted over a given block during Phase One of that 
block (time slots t = 1 or t = 4 constitute Phase One in the example of Section II-C I. It signals over Phase 
Two such that at the end of each block, (a) R2 can decode all DSs sent to it over that block, and (b) Rl can 
decode all DSs sent to it over the previous block (note that t = 2, 3 or t = 4, 5 are counterparts of Phase Two 



in the example of Section II-C I. To meet these objectives optimally, Tl needs to align interference at both 
receivers. 

• Finally, the goal of Block i3 + 1 is to allow Rl to decode all DSs of Block B by not sending any new data 
(Block i? + 1, in some sense, corresponds to time slot t = 7 in the example of Section II-C I. 

Let Tl and T2 transmit mi{i) and m2{i) DSs intended for their respective receivers at the i^'^ time slot of any 
given block (except, the last block) where these design parameters are chosen according to following criterion. 

Design Criterion 3: Choose non-negative integers mi{i) and m2(i), i G [1 : T] , as follows. 

. mi(i) = V i G [ii + 1 : T] (recall, ti + ^2 = T); 

. mi(i) < M( V i G [1 : h] and YI=i = E*Li mi{i) = Tdi, 

. m2{j) < Ms V j G [1 : T] and ^J^^ maO') = Tds. 
At time t G [1 : BT], Tl transmits mi(t(t)) complex Gaussian DSs, denoted by whereas T2 

transmits m2(t(t)) complex Gaussian DSs, denoted by U2i{h{t) ,t{t)) . Note that all DSs 

{{uu{b,i)}[ 



=1 it=l 



and \{{u2jib,t)r:f 

are i.i.d. Note that since mi{i) = V « G [ti + 1 : ti + 12], no new DS is transmitted by Tl to Rl over Phase Two 
of any block so as to enable interference alignment and to ensure successful decoding. 

The transmission scheme of Tl and T2 are described next. Focusing on Block b, where 6 G [1 : the operation 
of Tl and T2 is described over the two phases separately. We start below with Phase One of Block b. See also 
Tables I pII where the main points about the operation of this scheme are summarized. 

Block b, Phase One: Here, t G [b'T + 1 : b'T + h] with b e [1 : B] and b' = b - 1. This phase is a data 
transmission phase. 

At time t G [b'T + 1 : b'T + ti] with 6 G [1 : B], Tl and T2 respectively transmit mi(t{t)) and m2{t{t)) DSs 
as follows: 



Xii{t) 
Xii{t) 

X2j{t) 
X2,{t) 



uii[b{t),t{t)), Vi G 1 : mi[t{t) 



0, Wi G 



mi[t{t)) +1 : Ml 



; and 



U2j{b{t),t{t)], Vj G 1 : m2[t{t) 



0, Vi G 



and 



, and 



7712 +1 : M2 



Consider now the signals received by Rl and R2 during Phase One. Since we are interested in the achievability 
of the DoF, we ignore throughout the presence of additive noise since it can not affect a DoF result. Then for a 
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t G [b'T +l:b'T + h] with b € [1 : B], we have 

Yu{t) = Hui{t)Xi{t) + Hu2{t)X2{t) ■■■ i G [1 : iVi] 



Hui{t) 



uii (b{t),t{t) 
ui2 ibit),t{t) 



Oi 



Afi-mi(t(t))]xl 



+ Hu2{t) 



U21 (bit),tit) 

U22 m)~t{t) 



0| 



M2-m2{t(t))]xl 



F2,i(t)Xi(t)+i/2,2W^2(i) 

uu {b{t),t{t)) 
ui2 {bit),t{t)) 



LCfi(b(i),I{i)) 



N2] 



H2jl{t) 



Oi 



Mi-mi(t(t))]xl 



+ H2j2{t) 



U21 {b{t),t{t)) 

U22 {bit),tit)) 



0| 



M2-m2{t{t))]xl 



LcW(fe(t),i(t)) 



LCg(6(i),t(i)) 



Here, LC[j (6, t) represents the Unear combination of DSs sent by the k^'^ transmitter at time t = {b — 1)T + t, 

and it affects the signal received by the i^^ receiver at its j^'^ antenna. Note here that ^C^j (b{t),t{t)^ is a linear 
combination of DSs intended only for Rl, while it causes interference to R2. Note the collection 



is referred to henceforth as the interference seen by R2 during Phase One of Block b. Evidently, interference at R2 
is useful for Rl. This completes the description of the operation over Phase One. 

Consider next the second phase of Block b € [1 : B]. Here, T2 continues to transmit data. Tl, on the other hand. 



employs two interference alignment techniques (described in Section II-D) over Phase Two. In the first one, Tl 
transmits a part of the interference seen by R2 over Phase One of the previous block (this idea is not used during 
Block 1), and under the second technique, it transmits all DSs of T2 sent earlier over Phase One of the same block. 
Now, as shown shortly, if over each time slot of Phase Two Tl transmits an appropriate numbers of DSs of T2 
and the interfering linear combinations at R2 then at the end of each block R2 can decode the desired DSs sent 
over the same block, while Rl can decode those sent over the previous block. 



Lemma 7: At time t = b'T + h + 1, Tl can obtain (a) DSs <^ U2 



t G 



b'T+1 : b'T+ti 



b{t)rt{t))] 



V i G 



1 : m2(t{t 



and 



, and (b) linear combinations (b{t),t{t)'j j, V j G [1 : N2] and t G \b'T+l : b'T+h 

Proof: Consider symbols in (a). By virtue of Shannon feedback, Tl knows all past channel matrices H{t), the 
received signal Y2{t), as well as its own transmit signal. Hence, for each t G [b'T + 1 : b'T + ti], it can compute 

Y2{t) - H2l{t)Xi{t) = H22{t)X2{t). 

Since N2 > M2 under Condition 1, and the channel matrices are Rayleigh faded, H22{t) is almost surely a one-to- 
one map. Hence, based on the value of H22{t)X2{t), Tl can determine X2{t). In other words, at t = b'T + ti + 1, 
Tl can perfectly evaluate data symbols 

That Tl can obtain the symbols in part (b) of the lemma can be shown analogously. ■ 

Next, consider the transmission strategy of Tl over Phase Two. We need to construct two sets {'Plc(^; ^) 
and {"Pds (ft) ^) for each 6 G [1 : i? + 1]. The set Vhc{b,'i) contains the linear combinations that interfere with 
R2 over Phase One of Block 6 — 1 (except, for 6 = 1, for which this set is empty) and that are to be sent by Tl 
over the i*'^ time slot of Phase Two of Block b. Moreover, the set VBsib, i) contains DSs that are sent by T2 over 
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Phase One, Block 1: t is such that b{t) = 1 and t{t) € [1 : h]. 



node 


operation at time t 


Tl 


transmits DSs uu{l,t{t)), i € [1 : mi{t{t))]. 


T2 


transmits DSs M2j(l,i(t)), j € [1 : m2{t{t))]. 


Rl 


receives Yu{t) = hd^^ (1, t{t)) +LCf}{l, t{t)), i e [1 : iVi]. 


R2 


receives Y2j{t) = Ld^j{l,t{t)) +Ld^j{l,t{t)), j € [1 : N^]. 



Phase Two, Block 1: t is such that b{t) = 1 and t{t) € [1 : ^2]. 



node 


operation 


Tl 


retransmits DSs {{M2i(l,t)}^^/*^|*^ sent by T2 over Phase One. 
At any time at most 7V2 — M2 antennas are active. 


T2 


transmits U2j{l,t{t)), j € [1 : m2{t{t))]. 


Rl 


Gets linear combinations ||LC^^j'(l,t)| | 


R2 


Can decode all DSs sent over this block. 
Knows the values of linear combinations ||lC2j'(1,J)| | 



TABLE I 

Block 1 of the retrospective interference alignment scheme with Shannon feedback 



Phase One of Block b and that are to be retransmitted by Tl over the i*^ time slot of Phase Two of Block b. In 
terms of these sets the transmissions of Tl and T^ are then described.^ 

With an aim of constructing sets |^Lc(^)^)|. and |'Pds(^5^)|. for each b € [1 : B + 1], consider the 

following. Let (c — d)~^ = max{0, c — d}, \S\ denote the cardinality of set S, 5lc(1) = 4> (the empty set), and 
Sds{B + 1) = 4>. Further, define 

riveqii) = (rni{t) - Ni^ , where i G [1 : h] 
for a 6 G [2 : S + 1], <Slc(6) = | {lC™ [b - l,t) }"^"^*^ | , and 

forabG [1:5],5ds(&) = |{u2^(6,^)}^^^ | 

Here, the set Si,c{b), b>2, contains linear combinations that interfere with R2 over Block 5—1 and the elements 
of this set are chosen such that if all hnear combinations in this set are delivered to Rl then Rl can decode all 
desired DSs sent over Block 6—1. Moreover, sets {VLc{b,i)}^^_i are constructed by partitioning the set 5lc(^)- 
Further, the set 5ds(^)» b < B, contains all DSs sent by T2 over Phase One of Block b, and by partitioning this 

set, smaller sets |Pds(^)^)|. are formed. 
Note that 

|5lc(^)| = ^n,:ecSi) and ST,s{b)\ = ^m2(i). 
t=i t=i 
Consider next two more design criteria which ensure that the cardinaUties of these sets are appropriately bounded. 
Design Criterion 4: Choose t\, t2, and m2(t), where t G [1 : t\\, such that 

tl 

Sm{b)\ =^m2{t) < {N2 - M2) ■ t2. 
1=1 
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Phase One, Block b, b e [2 : B]: 



node 


operation at time t 


Tl 


transmits DSs uu{b,t{t)), i e [1 : mi{i{t))]. 


T2 


transmits DSs U2j{b,t{t)), j £ [1 : m2(t{t))]. 


Rl 


receives Yu{t) = LCW(6j(t)) + hd^}{b,t{t)), iVi]. 


R2 


receives Y2j{t) = LC™(6j(t)) + Lc!,'' (6, j G [1 : iV2]. 



Phase Two, Block b, b € [2 : B]: 



node 


operation 


Tl 


transmits DSs |{'"2i(6, i)}™^^^''*^ | and linear combinations ||lC2j' (fe — j- 
At any time, at most A^2 — M2 DSs and A'^i — N2 linear combinations are transmitted. 


T2 


transmits U2j{b,i{t)), j £ [1 : m2{i{t))]. 


Rl 


Gets linear combinations | IlC^^^' (6, | and |{lC^',' (6- 1, t) j""!*'*' | . 
Can decode all DSs sent over Block 6—1. 


R2 


Can decode all DSs sent over this Block. 
Knows the values of linear combinations 1 |lC2j' (6, t)| j- 



TABLE II 

Block b,be [2 : B], of retrospective IA scheme with Shannon feedback 



Design Criterion 5: Choose ti, t2, and mi{t), where t £ [1 : ti], such that 



5Lc(fe)| = £;nreq(t) < (iVl - iV2) • t2. 



t=l 



If these two criteria are satisfied, sets 5ds(^) and 5lc(^) can be partitioned as follows. 
• Partition 5lc(^) into t2 disjoint subsets each of cardinality at most A'^i — so that 

SLcib) = \JVi^c{b,i) and \Vi^c{b,i)\ < A^i - iVa- 



i=l 



Partition 5ds(^) into t2 disjoint subsets each of cardinality at most N2 — M2 so that 

Sm{b) = U ^DS {b, i) and |Pds i) | < iVs - M2. 



1=1 



1 : \VLc{b,i)\ + |Pds 



(24) 



(25) 



Suppose for each i G [1 : t2], 

VLc{b,i)[jVY)s{b,i) = j e 

Over Phase Two of Block h, Tl transmits all the elements of the set 5Lc(ft) 0*^03 while T2 continues to 
transmit DSs intended for R2. See also Tables [1111111 

Block h, Phase Two: Here, t G [b'T + ti + I : h'T + ti + ^2]. 

At time t G [h'T + ti + I : b'T + ti + 12] with 6 G [1 : -B + 1], Tl transmits the elements of set V-Lc{b,t{t) - 
tij [jVTis(b,t{t) - tl) as follows: 



Xu{t) 



p,[b{t),t{t)-tij, ifie 

0, if i G 



1 : 

1 + 



VLc{bit),tit)-h 



+ 



VBs{b{t),tit)-t^ 



VLc{b{t),tit)-tA + VBs{bit),t{t)-t 



: Ml 
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During Phase Two of Block b, T2 transmits DSs |{u2j(?*, if b < B, and remains silent over the 

last Block. Thus, we have the following: for t G [b'T + ti + 1 : b'T + ti + ^2] with b<B, 



and 



X2jit) = 0, i G [m2{t{t)j + 1 : M2 
whereas for t E [BT + ti + 1 : BT + ti + t2] (i.e., over Block B + 1), 

X2{t) = 0. 

This completes the description of the transmission strategy of Tl and T2. 

Consider now the decoding operation starting with R2. The following two lemmas enable an inductive proof that 
R2 can decode all desired DSs. See also Tables imilll 

The next lemma proves that R2 can decode all DSs sent to it over Block 1. 
Lemma 8: At time t = T, R2 can decode DSs 

sent to it over Block 1. 

Proof: Recall that for Block 1, the set 5lc(1) is empty. Thus, over Phase Two of Block 1, Tl transmits all 
elements of the set 5ds(1)> Tl retransmits all DSs |{n2i(l,t)}™^®|_ that are sent by T2 over Phase One 
of this block. Moreover, since \Vds (li ^)| < -^2 — M2, V i, b, Tl, at any time during Phase Two, transmits at most 
A'^2 — M2 elements of the set 5ds(1) = |{'W2i(l5i)}™?/*^|_ ■ This implies that at any time during Phase Two 
of Block 1, at most N2 transmit antennas are active (i.e., they send a non-zero signal). Since the Rayleigh-faded 
channel matrices are full rank almost surely, R2, equipped with A'^2 antennas, can determine the transmit signal 
via simple channel inversion. Therefore, at time t € [ti + 1 : T], R2 can decode all DSs belonging to the set 



1 : m2{i{t)) 



T^DS [b{t),t{t) — tij and also those transmitted by T2 at that time, namely, U2iyl,t{t) 
Therefore, at the end of Block 1, i.e., at t = T, R2 can decode all DSs sent to it over this block. 
The next lemma deals with decoding of DSs sent to R2 over Block b, b £ [2 : B]. 

Lemma 9: Consider Block b, b e [2 : B]. If, at time t = {b — 1)T, R2 has successfully decoded all DSs 

T 



m2{t) 



( _ 1 m2(, 



t=i 



sent to it over Block (6 — 1), then at time t 



bT, it can decode all DSs 



i U2i{b,t) \ 



t=l 



sent to it over Block b. 

Proof: Suppose at time t = {b — 1)T, R2 has decoded successfully all DSs sent to it over Block (6 — 1). Thus, 
R2, at time t = {b — l)T, can determine the values of linear combinations 



LC|J {b-l,t)= H2j2 [{b -2)T + t 



U21 [b 
U22 {b 



l,t) 



{b-l,t) 

M2-m2(?)]xl 



2m2(i) 
0| 



V j G [1 : N2] and t £ [I : h], 



and hence, it can also evaluate 

.[1] 



LC^'j (6 -l,t)= Y2j {b - 2)T + t]- LCgJ (6 - 1, t) , Vj G [1 : N2\ and t G [1 : ti]. 
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Phase One, Block B + 1: No operation is performed. 
Phase Two, Block B + 1: 



node 


operation 


Tl 


tra.iisinits lincsr combinations s "j LC2 ■ y B ^ ^} f f ■ 

I ^ ' -'3=1 Jt=i 


T2 


remains silent 


Rl 


Gets linear combinations'! ^LCgj^ (-B,t)| ' j- 
Can decode all DSs sent over Block B. 


R2 


idle 



TABLE III 

Block B + 1 of retrospective IA scheme with Shannon feedback 



In particular, R2 at time t = {b 



1)T knows values of all elements of set 



5lc(6)= {LcW(6-l,t)}";^ 



t=i 



Consider now the operation over Phase Two of Block b, i.e., for a t G [(6 — 1)T + ti + 1 : (6 — 1)T + ti + t2\. 
Since R2 already knows the values of elements of set 5lc(^)> it can subtract from ¥2(1) the contribution due to 
the elements of set VLc{b,i{t) — ti) C 5lc(^)> which are transmitted by Tl at time t. After this subtraction, from 
the perspective of R2, not more than ^^^^2 transmit antennas are active. Therefore, as mentioned before, R2 can use 
channel inversion to determine the elements of set Pds(^5*(0 — ^i) and also U2i{h,t{t)^ V z € [1 : ?n2(t(t))]. ■ 

Combined with the result of the previous lemma that all Block 1 DSs can be decided at t = T, the above lemma 
can now be applied recursively to show that R2 can decode all desired DSs sent over the first B blocks. Since no 
new DSs are transmitted over the last {B + block, decoding is successful at R2. 

Consider now the decoding procedure at Rl. See also Tables |l III It turns out that Rl, at the end of a given 
block, does not observe a sufficient number of interference-free linear combinations required to decode desired DSs 
sent over that block. However, the missing linear combinations are sent to it over the next block. Hence, DSs sent 
over a given block are decodable at Rl at the end of the next block. Since no new DS is sent over the final block, 
decoding is successful at Rl at the end of Block B + 1. 

The above claims about how decoding works at Rl are proved using a series of three lemmas. The first two 
specify the linear combinations that are known to Rl at the end of each block. The third lemma makes use of the 
first two to prove that all desired DSs are decodable at Rl at the end of Block B + 1. 

Lemma 10: At the end of Block 1, i.e., time t = T, Rl can obtain linear combinations 

LCfJ Vi G [1 : A^i] and t € [1 : ti]. 

Proof: As with R2, Rl at t = T can determine the DSs 

{{u«{uw)}:rr'};i,. 



Hence it can evaluate 



and then, LCfJ = Yu{t) - hC^^^ , Vi G [1 : A^i] and t G [1 : ti]. 



LCgl =Hu2{t) 



U21 
U22 



■2m2(t) 



y[M2-m2{t)]xl, 



, Vi G [1 : Ni] and t G [1 : ti], 
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Lemma 11: At the end of Block b, b E [2 : B], i.e., at t = bT, Rl can obtain the linear combinations 

{{LCW(M)}fjj;' ^ and { {LCW (6 - U) }-.'"};■ ^. 
Further, at the end of Block B + 1, Rl can obtain the linear combinations 

{{LcW(B,i)}-;«}:'_. 

Proof: Consider the operation over Phase Two of Block b, b < B i.e., for t G [b'T + ti + 1 : b'T + ti + t2] 

with h < B. At any time during this phase, at most A^i transmit antennas are active. Hence, via simple channel 
inversion, Rl can determine the transmit signal during this phase. Thus, at the end of this phase, i.e., at t = 
bT, Rl knows the values of elements of the sets 5lc(&) and iSds(&)- This implies that at the end of Block b 

Rl knows linear combinations |{LC2^^(6 — 1, t) }"_r^^*n_ , which are contained in the set 5lc(^)- Further, it 

- t 1 ^ 

knows DSs |{n2j(6, , from which it can compute |{LCf^' (6, i) }^^^| , and subsequently obtain 

{{Lcra(M)}fjj;- . 

The last part of the lemma can be proved using similar arguments. ■ 
Lemma 12: At the end of Block 6, where 6 G [2 : i? + 1], Rl can decode data symbols 

^- 

t=i 

sent to it over Block b — 1. 

Proof: First recall that no new DSs are transmitted to Rl during Block B + 1. Therefore, we will focus on 
Block 6—1, where 6 G [2 : S + 1]. It will be shown that given any t such that b{t) = 6 — 1, all DSs sent by TI 
to Rl at time t can be decoded by Rl at the end of Block 6. Moreover, since no new DSs are sent to Rl over 
Phase Two of any block, we may assume, without loss of generality, that t{t) G [1 : ti]. Thus, in the following, we 
consider a time slot t with b{t) = (6 - 1), 6 G [2 : 5 + 1], and t{t) G [1 : h] or t = {b-2)T + t with t e [1 : h]. 
At this time, TI transmits DSs 

\^uu(b-l,i{t)^Yie [l :mi(i(i)) . 

To decode these DSs, it is sufficient for Rl to know mi (t{t) \ linearly independent linear combinations of these 
data symbols. Consider the following collection of linear combinations: 



{{ 



{lcW(6- and {lcW (6 - 1. t{t) 



i=l 



These are mi (t{t)) linear combinations of DSs {nij (6— 1, }, i G [l : mi , and since the Rayleigh-faded 

channel matrices are full rank with probability 1, these linear combinations are almost surely Unearly independent. 
Hence, if Rl knows the values of these linear combinations, it can decode data symbols transmitted at time t. 
Moreover, by combining the results of previous two lemmas, we observe that Rl can determine the linear 

combinations jLCj^^ (6—1, t{t)) | at the end of Block (6— 1), while it can obtain linear combinations |lC2^' (6— 

l,t{t)] > at the end of Block 6. Thus, at the end of Block 6, Rl can decode all DSs sent to it at time t, 

which belongs to Phase One of Block 6—1. Hence, at the end of Block 6, Rl can decode all DSs sent to it over 
Block 6-1. ■ 

Thus, we conclude that by coding over T* time slots, {d*, dg) DoF can be achieved as desired. This completes 
the description of our generic retrospective interference alignment scheme. 

We will now use this scheme to prove that Po2,3 € under Case A and Po2,i,Pi,3 € under Case B in the 
following three subsections. 
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A. Proof of Po2,3 e under Case A 

Recall that under Case A, Condition 1 holds and 



M[ > N2 



Ni - M2 



■N2-M2' 

and Po2,3 = {di,d2) = {Ni - M2, M2). 

We use the generic retrospective interference alignment scheme with the parameters chosen as follows: 

T = N2, ti = N2- M2, t2 = M2, and m2{t) = M2 G [1 : T]. 

It is easy to verify that this choice satisfies Design Criteria 1 and 2. 
In order to choose mi(t), t € [1 : ti], consider the following: 



h ■ 
h ■ 



N2 
N2 
N2 



Ni 


-M2 


N2 


-M2 


Ni 


-M2 


N2 


-M2 


Ni 


-M2 


N2 


-M2 



> N2 



Ni - M2 
N2-M2 



> 



N2 



Ni - M2 



No 



N1-M2 



N2 



M2. 
N1-M2 



N2-M2 
> N2{Ni - M2) = Tdi > h 



N2-M2 

Ni - M2 



N2 



N2 - M2 



This suggests that we can choose mi(t), t G [1 : ti] as 



mi{t) e 



N2 



Ni - M2 
N2-M2 



N2 



Ni - M2 
N2-M2 



Vt G [1 : ti], such that 



Tdi = iV2(iVi-M2) = 5^mi(t), 



t=i 

where [.xj denotes the largest integer that is less than or equal to x. It can be verified that this choice of parameters 
satisfies Design Criterion 3. Furthermore, Design Criterion 4 holds since (A^2 — M2)t2 = (A^2 — M2)M2 and 
Eti ^2(^) = M2ti = M2{N2 - M2). Next we prove that Design Criteria 5 is satisfied as well. 
First we show that mi{t) > ATj V t G [1 : ti] for which it is sufficient to prove that 

N1-M2' 



N2 



This inequality is proved as follows: 

Ni> N2^ Ni- M2> N2- M2 

N2 ^ m 



N2 - M2 

M2 



> Ni. 



N2 - M2 



+ 1 > 



M2 



> 



Ni - M2 
N2^ ^ > Ni 



Ni-M2^'^ 
Ni - M2 



N2 



N2 - M2 



>Ni, 



N2- M2 ~ Ni- M2 N2 - M2 

as desired. Now, making use of the fact that mi(t) — ATi > V t G [1 : ti], we obtain the following: 

ti ti 



tl 



t=l 



^ |mi(t)| - Nih = Tdi-Niti ■■■ by Design Criteria 3 

• by the choice of ti and T 



t=i 



= N2 (iVi - M2 )-Ni{N2-M2) 
= M2{Ni - N2) = (Ni - N2)t2, 

which proves that Design Criteria 5 is satisfied. Since all Design Criteria 1-5 hold, we know that Po2,3 £ 



B. Proof of Po2,i e under Case B 

Recall that under Case A, Condition 1 holds and 

Here, 

Po2,l = {dl,d2) = [m[ ^''~^\ M^ . 

We use the generic retrospective interference alignment scheme with the following choice of parameters: 

T = N2, h = N2- M2, t2 = M2, 
mi{t) = M[ y t e [1 : h], 
m2{t) = M2 V t G [1 : T]. 

It is easy to verify that Design Criteria 1-3 are satisfied. Design Criterion 4 holds because Y^i^i '^2(i) = ^2 
M2{N2 — M2) = {N2 — M2)t2- Consider now the proof that the last criterion is satisfied. 
Here, nreq(i) = M[ - Ni M t [I : ti]. Hence, 



<SlcW = {K - Ni)ti = {M[ - Ni){N2 - M2). 



Design Criterion 5 because 



M[ < N2— — — • • • by definition of Case B 

^- N2-M2 

M[{N2 - M2) < N2{Ni - M2) = {Ni - N2)M2 + N^{N2 - M2) 
^ {M[ ~ Ni){N2 - M2) < (iVi - N2)M2 
^|5lc(6)| <{Ni-N2)t2. 

Since all Design Criteria hold, Po2,i € under Case B. 



C. Proof of Pi ^3 G under Case B 
Here, 



^, Ni - N2 M( - Ni 



Set 

T = M[- N2, h = Ni- N2, t2 = M[- Ni 
and mi (t) = Mi V ? G [1 -.h]. 

In order to choose m2{i), consider the following argument. 

Mi < iV2 ^'~!!' M[{N2 - M2) < N2iNi - M2) 

1\2 — M2 

=^ N2{M[ - Ni) < M2{M[ - N2) 

M2N1 - M2M[ + N2{M[ -Ni)< M2N1 - M2N2 
M2{Ni - N2) = M2ti > {N2 - M2){M[ - Ni). 

Therefore, we may select m2{i), i e [I : ti], such that 

ti 

< m2{t) < M2 Vt G [1 : h] and 5^m2(t) = (A^2 - M2)(M{ - Ni); 

t=i 

and ma© = M2, e [h + 1 : h + ^2]- 
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It can be easily verified that the above choice of parameters satisfies the first three design criteria. Design Criterion 
4 holds because 

ti 

{N2 - M2)t2 = (M2 - M2){M[ - Ni) m2{t). 

t=l 

Similarly, it is easy to verify that Design Criterion 5 holds. Therefore, we have that that Pi^s G under Case B. 

V. Conclusion 

In this paper, the fast fading MIMO IC is studied under the Shannon feedback setting in which the transmitters are 
assumed to have perfect knowledge of the channel matrices and the channel outputs, both with a finite delay. Under 
such a setting, the DoF region is determined with the proof involving in part the demonstration of a key achievability 
result that in some cases output feedback can improve the DoF region in presence of delayed CSIT. To realize 
the DoF gains attainable with Shannon feedback, this new achievability scheme not only employs all interference 
alignment techniques that are feasible with just delayed CSIT, but in addition, also exploits the additional transmitter 
cooperation that output feedback can induce. This result is further strengthened by identifying scenarios of limited 
Shannon feedback in which the entire Shannon-feedback DoF region is achievable even under the knowledge of 
some of the channel matrices and channel outputs at the transmitters. For example, the three DoF regions under just 
delayed CSIT, just output feedback, and Shannon feedback are proved to be identical for a large class of MIMO 
ICs. Moreover, while this work obtains the DoF region under output feedback for a large class of MIMO ICs, its 
complete characterization and its relationship to the delayed CSIT and Shannon feedback DoF regions remains an 
open problem that merits further investigation. 



Appendix A 
Proof of Lemma [T] 

The region D'^^^'^°p is achievable with just instantaneous CSIT [5]. Thus, it is sufficient to prove that D'^^^'^°p is 
an outer-bound to the DoF region of the MIMO IC with instantaneous CSIT and output feedback. Toward this end, 
recall that the DoF achievable over the point-to-point MIMO channel can not exceed the minimum of the number of 
transmit and receive antennas |25| (this is referred as the 'single-user' bound). This implies that di < m.m{Mi, Ni) 
for each i S {1, 2}. Consider now the bound on di + d2- Even if both transmitters and both receivers are assumed 
to cooperate, the total sum-DoF are limited by Mi + M2 and A'^i + ^^^^2 due to the single-user bound. Therefore, 
di + d2 < min{Mi + M2, Ni + Now, due to symmetry, it is sufficient to show that di-\-d2 < max{Mi, N2). 
The proof of this claim, which makes use of techniques developed in [5|, is given below. 



A. Proof of di + d2 < max(Mi, with Instantaneous CSIT and Output Feedback 
As stated earlier, it is sufficient to prove that di + d2 < max(Mi, 
Lemma 13: For the MIMO IC with i.i.d. Rayleigh fading and iV2 > Mi, 

di + d2< N2 

when there is instantaneous CSIT and output feedback. 

Proof: The proof of this lemma is based on the techniques developed in |5J. This lemma can not however 
be immediately deduced from ||5] Theorem 9] because (i) the model of IC with cooperation studied in ||5] Section 
IV] does not include the case of output feedback considered here and (ii) in the IC considered in Q, the channel 
matrices are time-invariant and deterministic (not fading) and (iii) the IC is known to be not separable in general 



|27| . See Appendix A-B for the complete proof. ■ 
Thus, as per the above lemma, the required inequality holds when N2 > Mi. When < Mi, as argued in 
1 28 1, |[5|, we may add Mi — N2 antennas at R2 (which can not reduce the DoF region), and then apply the above 



lemma to prove that di + d2 < Mi if Mi > N2. Therefore, we together have di + d2 < max(Mi, A''2), as desired. 
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B. Proof of Lemma 13 



This lemma is proved by making use of the techniques developed in fF, Proof of Theorem 9]. 

Let Ui{t) = Hii{t)Xi{t) + Wi{t) and U2it) = H2i{t)Xi{t) + W2{t). Then we have the following corollary, 
which is stated using the notation of Section |II-A| 
Corollary 4 (Lemma 8, ^5J): The following is true: 

Xi{n) ^ Ml, M2,Ui{n-l),U2{n-l),H{ny, 
X2in) ^ M2,Ui{n-l),U2in-l),Hin); 
Fi(n), F2(n) ^ M2, Ui{n), U2{n), H{n), 

where a h denotes the fact that a is a deterministic function of b. 

Proof: Can be proved via induction. ■ 
We now apply Fano's inequality assuming that Rl knows the message M2, and signals f/i(n) and U2{n) to 
obtain 



nRi < l[Mi]Yi{n),Ui{n),U2{n) M2,H{n)j +nen 
= l(^Mi-,Ui{n),U2{n)\M2,H{n)) +nen 

= h(Ui{n),U2{n) M2,H{n)) - h(u i{n) ,U 2{n) X2, Xi, i?H ) + ne. 



where the first equality holds since Yi{n) ^ {M2, Ui{n), U2{n), ff(n)}. Now, following the analysis in ijsj Proof 
of Theorem 9], we get 

n 

h(Ui{n),U2{n)\M2,H{n)') < h(Y2{n)\M2,H{n)') +^h(Ui{t)\u2it),H{n)'^ 

t=i 

h(Ui{t)\u2{t),H{n)) < o(log2P) 
h(Ui{n),U2{n)\M2,Mi,H{n)') > n-o(log2P), 
where o(log2 P) is constant with n. These bounds give 

nRi < h(Y2in) M2,H{n)^ + n ■ o{log2 P) + nen- 
Now Fano's inequality applied at R2 yields 



(26) 



ni?2 < I M2\Yo(n 



< 



H{n) + nen 



h{Y2{n)\H{n)^ - h{Y2{n)\M2,H{n)^ + nen 
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The desired bound can now be derived by adding inequalities in p6| ) and ( p7| ), and subsequently applying the 
single-user bound (cf. ||5j Proof of Theorem 9]). 



Appendix B 
Proofs of Corollaries [I] and [2] 

A. Proof of Corollary |7] 

First consider the case of limited Shannon feedback. Since 

j)/si j)/S2 ^ j^s^ jt sufficient to prove that the 

DoF regions D'Si and D^^^ 

are achievable when there is limited Shannon feedback of Types 1 and 2, respectively. 
In other words, we need to prove that the region D^^^^gj, is achievable under two types of limited Shannon feedback, 
which we do next. Again, assume without loss of generality that Ni> N2. 

If Condition 1 does not hold, then DQ^^(J^ = D'^*~'^^. Then, as pointed out in | ,20J , the region D'^^^^ is achievable 
when for each i G {1, 2}, the i*^ transmitter knows Hji{t) and Hjj{t), j G {1, 2} with j / i, with a delay. Hence, 
when Condition 1 does hold, the region D^^j^^jj. = D'^^^^ is achievable with limited Shannon feedback of Types 1 
and 2. 
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When Condition 1 holds, the region 0^^^^^^, is shown to be achievable with Shannon feedback of both types by 
developing a coding scheme in Section IV It can be verified that this coding scheme works even under limited 
Shannon feedback of both types. 

Hence, together, we have proved that D^^^gj. is achievable with limited Shannon feedback. 

For the setting of designable Shannon feedback: Clearly, C D'*^. Hence, it is sufficient to prove that the 
region D^y^^^^. is an outer-bound, even under designable Shannon feedback. Toward this end, the proof of Theorem 
[T] can be easily modified to apply to the general case of designable Shannon feedback. 



B. Proof of Corollary |2] 

The first part of the corollary follows trivially since the setting of Shannon feedback is stronger than that of 
output feedback. To prove the second part of the corollary, we assume below without loss of generality that 
A^i > and prove that the region = D^y^^^^. is achievable with output feedback, whenever the inequality 
min(Mi,iVi) > N2 > M2 does not hold (note, with A^i > N2, the second inequality in the statement of the 
corollary can never be true). This is the goal of the remainder of this sub-section. 

Here, by no side-information at the transmitters, we mean the setting where the transmitters have no knowledge 
whatsoever of the channel states and the channel outputs; and denote the corresponding DoF region by 0"^° which 
is known from the literature |[9l-|[TT|, |[T3 1. 



Throughput the remainder of this subsection, we assume that the inequality min(Mi, A^i) > A''2 > M2 does not 
hold. Then 

j^no ^ J30P c = D'^^^^ = D^uter ^ D^^S^ = D'^^^^°P. 



From |20 Table I], we observe that D"^" = D^^^-^^^ if the following two inequalities do not hold: 
. Ml > max(iVi,iV2), M2 > iVi, and M2 > N2 (Case A.I.3 in [,20, Table I]); and 
. Ml > max(Ari,iV2) and iVi > M2 > N2 (Case A.II.2 in |[20| Table I]). 

Hence, whenever Cases A.I.3 and A.II.2 do not hold (recall the inequality min(Mi,A'^i) > N2 > M2 is not true), 

then 

j-jno j-jdCSI _^ Y)^° j-)dCSI J])°P Y)"^- 

and thus the corollary holds. 

We now proceed to Cases A.I.3 and A.II.2, under which we want to show that D'^^^^ = D°p. Toward this end, 
for Case A.I.3 and A.II.2, the region D'^^^^ has been shown to be achievable under delayed CSIT by developing 
two lA-based coding scheme in [20, Section VI] and [20, Section VII], respectively. Although these schemes have 
been developed there for delayed-CSIT case, they work even with output feedback. Therefore, using the schemes 



of 1 20 Section VI] and |20 Section VE], we conclude that D^^^si ^ j^op m^^jgj. q^^^^ ^ j 3 ^ jj 2 
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